AITopics

2412.14724

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > California (0.04)
Europe > Portugal (0.04)

Genre: Research Report (0.64)

Industry: Law (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Qasem, Rabee, Hendi, Mohannad, Tantour, Banan

ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine

arXiv.org Artificial IntelligenceDec-19-2024

Large language models (LLMs) have gained significant attention over the past few years, particularly following the emergence of ChatGPT, both from researchers Movva et al. [2024] and in terms of their adoption in the private sector. This hype has revolutionized how we use AI in different fields and helped redefine how various domains utilize AI, such as in medicine Alghamdi and Mostafa [2024], Yuan et al. [2024], finance Xie et al. [2024], Malaysha et al. [2024], and even agriculture Gupta et al. [2024]. These advancements demonstrate the profound potential of AI to transform industries, driving innovation and efficiency in ways previously unimaginable. However, one domain that still has room for growth and the potential to bring about significant change is the legal domain Martin et al. [2024], Maree et al. [2024]. Although sectors such as healthcare and finance have rapidly adopted AI to address their unique challenges, the legal industry has been relatively slow to embrace these technologies Legg and Bell [2020]. The complex nature of legal language, coupled with jurisdictional variations and the high stakes involved, and the lack of AI regulations de Almeida et al. [2021], Nadjia [2024], has presented significant obstacles to developing effective AI-powered legal

arxiv preprint arxiv, large language model, machine learning, (18 more...)

2412.14771

Country:

Asia > Middle East > Palestine (0.84)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre: Research Report (1.00)

Industry:

Law > Statutes (0.49)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Medlin, Karen, Leyffer, Sven, Raghavan, Krishnan

A Bilevel Optimization Framework for Imbalanced Data Classification

arXiv.org Machine LearningDec-16-2024

Data rebalancing techniques, including oversampling and undersampling, are a common approach to addressing the challenges of imbalanced data. To tackle unresolved problems related to both oversampling and undersampling, we propose a new undersampling approach that: (i) avoids the pitfalls of noise and overlap caused by synthetic data and (ii) avoids the pitfall of under-fitting caused by random undersampling. Instead of undersampling majority data randomly, our method undersamples datapoints based on their ability to improve model loss. Using improved model loss as a proxy measurement for classification performance, our technique assesses a datapoint's impact on loss and rejects those unable to improve it. In so doing, our approach rejects majority datapoints redundant to datapoints already accepted and, thereby, finds an optimal subset of majority training data for classification. The accept/reject component of our algorithm is motivated by a bilevel optimization problem uniquely formulated to identify the optimal training set we seek. Experimental results show our proposed technique with F1 scores up to 10% higher than state-of-the-art methods.

artificial intelligence, datapoint, machine learning, (17 more...)

2410.11171

Country:

North America > United States > North Carolina (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > South Korea > Gyeongsangnam-do > Changwon (0.04)

Genre:

Research Report > Promising Solution (0.48)
Research Report > New Finding (0.48)

Industry:

Law Enforcement & Public Safety > Fraud (0.47)
Government > Regional Government (0.47)
Health & Medicine (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.72)

arXiv.org Artificial IntelligenceDec-19-2023

Describing Robots from Design to Learning: Towards an Interactive Lifecycle Representation of Robots

Qiu, Nuofan, Wan, Fang, Song, Chaoyang

As autonomous machines capable of interacting with the real world, various types of robots, such as wheeled mobile robots, quadrupedal robots, and humanoid robots, are emerging in domestic, factory, and other environments to collaborate with humans or accomplish tasks independently. The morphology of a robot is the essential factor that most directly affects the robot's configuration space, thereby determining the robot's function [1]. Robot morphology is primarily determined during the design process, thanks to the development of computer-aided design (CAD) technology, which makes it cost-effective, time-saving, and efficient compared to the manufacturing process. Beyond robot morphology, learning has become an essential topic in robotics because it enables robots to achieve complex tasks and, thus, better interact with the environment. However, training robots in hardware may lead to failures or damage, making it expensive and time-consuming.

fusion 360, information, robot description format, (15 more...)

2312.12295

Country:

Asia > China > Guangdong Province > Shenzhen (0.05)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.40)

Industry: Information Technology (0.35)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Yousif, Amira, Buckley, James

Impact of Sentiment Analysis in Fake Review Detection

arXiv.org Artificial IntelligenceDec-17-2022

Fake review identification is an important topic and has gained the interest of experts all around the world. Identifying fake reviews is challenging for researchers, and there are several primary challenges to fake review detection. We propose developing an initial research paper for investigating fake reviews by using sentiment analysis. Ten research papers are identified that show fake reviews, and they discuss currently available solutions for predicting or detecting fake reviews. They also show the distribution of fake and truthful reviews through the analysis of sentiment. We summarize and compare previous studies related to fake reviews. We highlight the most significant challenges in the sentiment evaluation process and demonstrate that there is a significant impact on sentiment scores used to identify fake feedback.

artificial intelligence, machine learning, natural language, (15 more...)

2212.08995

Country: North America > United States > Ohio > Montgomery County > Dayton (0.05)

Genre: Research Report (0.83)

Industry: Information Technology > Services (0.48)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
(2 more...)

arXiv.org Artificial IntelligenceDec-17-2022

A Layered Architecture for Universal Causality

Mahadevan, Sridhar

We propose a layered hierarchical architecture called UCLA (Universal Causality Layered Architecture), which combines multiple levels of categorical abstraction for causal inference. At the top-most level, causal interventions are modeled combinatorially using a simplicial category of ordinal numbers. At the second layer, causal models are defined by a graph-type category. The non-random ``surgical" operations on causal structures, such as edge deletion, are captured using degeneracy and face operators from the simplicial layer above. The third categorical abstraction layer corresponds to the data layer in causal inference. The fourth homotopy layer comprises of additional structure imposed on the instance layer above, such as a topological space, which enables evaluating causal models on datasets. Functors map between every pair of layers in UCLA. Each functor between layers is characterized by a universal arrow, which defines an isomorphism between every pair of categorical layers. These universal arrows define universal elements and representations through the Yoneda Lemma, and in turn lead to a new category of elements based on a construction introduced by Grothendieck. Causal inference between each pair of layers is defined as a lifting problem, a commutative diagram whose objects are categories, and whose morphisms are functors that are characterized as different types of fibrations. We illustrate the UCLA architecture using a range of examples, including integer-valued multisets that represent a non-graphical framework for conditional independence, and causal models based on graphs and string diagrams using symmetric monoidal categories. We define causal effect in terms of the homotopy colimit of the nerve of the category of elements.

artificial intelligence, category, machine learning, (19 more...)

2212.08981

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > New York (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Immunology (0.46)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)

Dagli, Rishit, Shaikh, Ali Mustufa

CPPE-5: Medical Personal Protective Equipment Dataset

arXiv.org Artificial IntelligenceDec-15-2021

We present a new challenging dataset, CPPE - 5 (Medical Personal Protective Equipment), with the goal to allow the study of subordinate categorization of medical personal protective equipments, which is not possible with other popular data sets that focus on broad level categories (such as PASCAL VOC, ImageNet, Microsoft COCO, OpenImages, etc). To make it easy for models trained on this dataset to be used in practical scenarios in complex scenes, our dataset mainly contains images that show complex scenes with several objects in each scene in their natural context. The image collection for this dataset focusing on: obtaining as many non-iconic images as possible and making sure all the images are real-life images unlike other existing datasets in this area. Our dataset includes 5 object categories (coveralls, face shield, gloves, mask, and goggles) and each image is annotated with a set of bounding boxes and positive labels. We present a detailed analysis of the dataset in comparison to other popular broad category datasets as well as datasets focusing on personal protective equipments, we also find that at present there exist no such publicly available datasets. Finally we also analyze performance and compare model complexities on baseline and state-of-the-art models for bounding box results. Our code, data, and trained models are available at https://git.io/cppe5-dataset .

category, dataset, detection, (11 more...)

2112.09569

Country:

North America > United States > California (0.04)
Asia > India > Maharashtra > Mumbai (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > China (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Public Health > Disease Control (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Both, Gert-Jan, Kusters, Remy

Temporal Normalizing Flows

arXiv.org Machine LearningDec-19-2019

Analyzing and interpreting time-dependent stochastic data requires accurate and robust density estimation. In this paper we extend the concept of normalizing flows to so-called temporal Normalizing Flows (tNFs) to estimate time dependent distributions, leveraging the full spatio-temporal information present in the dataset. Our approach is unsupervised, does not require an a-priori characteristic scale and can accurately estimate multi-scale distributions of vastly different length scales. We illustrate tNFs on sparse datasets of Brownian and chemotactic walkers, showing that the inclusion of temporal information enhances density estimation. Finally, we speculate how tNFs can be applied to fit and discover the continuous PDE underlying a stochastic process.

arxiv, density estimation, tnf, (14 more...)

1912.09092

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (0.53)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Mehdiyev, Rashid, Nava, Jean, Sodhi, Karan, Acharya, Saurav, Rana, Annie Ibrahim

Topic subject creation using unsupervised learning for topic modeling

arXiv.org Machine LearningDec-18-2019

We describe the use of Non-Negative Matrix Factorization (NMF) and Latent Dirichlet Allocation (LDA) algorithms to perform topic mining and labelling applied to retail customer communications in attempt to characterize the subject of customers inquiries. In this paper we compare both algorithms in the topic mining performance and propose methods to assign topic subject labels in an automated way.

matrix, nmf, topic term, (14 more...)

1912.08868

Country:

North America > Canada > Quebec > Capitale-Nationale Region > Québec (0.04)
North America > Canada > Quebec > Capitale-Nationale Region > Quebec City (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(2 more...)

Genre: Research Report (0.50)

Industry:

Media (0.47)
Leisure & Entertainment (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Lee, Taehee, Lawrence, Charles E.

Heteroscedastic Gaussian Process Regression on the Alkenone over Sea Surface Temperatures

arXiv.org Machine LearningDec-18-2019

To restore the historical sea surface temperatures (SSTs) better, it is important to construct a good calibration model for the associated proxies. In this paper, we introduce a new model for alkenone (${\rm{U}}_{37}^{\rm{K}'}$) based on the heteroscedastic Gaussian process (GP) regression method. Our nonparametric approach not only deals with the variable pattern of noises over SSTs but also contains a Bayesian method of classifying potential outliers.

ecember 20, outlier, regression, (14 more...)

1912.08843

Country:

North America > United States > Rhode Island (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.50)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.34)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)